AITopics | model 0

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Virginia (0.04)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Parfenova, Angelina, Denzler, Alexander, Pfeffer, Juergen

Emergent Convergence in Multi-Agent LLM Annotation

arXiv.org Artificial IntelligenceDec-2-2025

Large language models (LLMs) are increasingly deployed in collaborative settings, yet little is known about how they coordinate when treated as black-box agents. We simulate 7500 multi-agent, multi-round discussions in an inductive coding task, generating over 125000 utterances that capture both final annotations and their interactional histories. We introduce process-level metrics: code stability, semantic self-consistency, and lexical confidence alongside sentiment and convergence measures, to track coordination dynamics. To probe deeper alignment signals, we analyze the evolving geometry of output embeddings, showing that intrinsic dimensionality declines over rounds, suggesting semantic compression. The results reveal that LLM groups converge lexically and semantically, develop asymmetric influence patterns, and exhibit negotiation-like behaviors despite the absence of explicit role prompting. This work demonstrates how black-box interaction analysis can surface emergent coordination strategies, offering a scalable complement to internal probe-based interpretability methods.

artificial intelligence, large language model, natural language, (14 more...)

doi: 10.18653/v1/2025.blackboxnlp-1.12

2512.00047

Country:

North America > United States (0.28)
Europe (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

arXiv.org Artificial IntelligenceNov-26-2025

Towards Benign Memory Forgetting for Selective Multimodal Large Language Model Unlearning

Zeng, Zhen, Gu, Leijiang, Duan, Zhangling, Li, Feng, Shi, Zenglin, Snoek, Cees G. M., Wang, Meng

Multimodal Large Language Models (MLLMs) achieve remarkable capabilities but can inadvertently memorize privacy-sensitive information. Although existing unlearning methods can remove such knowledge, they fail to achieve benign forgetting because they often degrade the model's general image understanding performance. T o address this, we propose the Sculpted Memory F orget-ting Adapter (SMF A), which confines forgetting to targeted memory regions while preserving overall capabilities. SMF A first fine-tunes the model to replace sensitive responses with refusals, yielding a memory forgetting adapter, and then applies a retaining anchor-guided masking mechanism to prevent interference with unrelated knowledge and understanding ability. T o systematically evaluate selective MLLM unlearning, we introduce S-MLLMUn Bench, the first benchmark designed to jointly assess the removal of sensitive knowledge and retention of general visual understanding. Extensive experiments show that, unlike prior methods, SMF A achieves precise and controllable unlearning while maintaining the model's foundational image understanding.

artificial intelligence, large language model, natural language, (15 more...)

2511.20196

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.87)

Neural Information Processing SystemsNov-19-2025, 23:39:03 GMT

A Appendix 483 A.1 Theoretical Proofs

Proposition A.2. Assume filter atoms D Theorem A.4. Suppose the forward of decomposed convolution layer for the According to Lemma A.5, we have, Based on Lemma A.5, we have, Based on Lemma A.7, we have, Theorem A.9. Suppose the forward of decomposed convolution layer for the As Assumption 2.6 holds, it becomes As shown in Table 3, our method achieves comparable performance among different methods. The fully-connected layer of each model is fine-tuned on the user's local data with 100 The fine-tuning takes about 12 hours on Nvidia RTX A5000. All the points are below the line which is the bound provided by Proposition 2.1, reflecting that the Figure 6: The shared coefficients and user-specific atoms represent common knowledge and personalized information. The filter subspace similarity is used to calculate the relations among users. And the correlation can reach 0.985 with CIFAR-100) are similar among themselves, but they differ from untrained models.

artificial intelligence, machine learning, similarity, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Pons, Gerard, Bilalli, Besim, Queralt, Anna

Improving Continual Learning of Knowledge Graph Embeddings via Informed Initialization

arXiv.org Artificial IntelligenceNov-17-2025

Many Knowledege Graphs (KGs) are frequently updated, forcing their Knowledge Graph Embeddings (KGEs) to adapt to these changes. To address this problem, continual learning techniques for KGEs incorporate embeddings for new entities while updating the old ones. One necessary step in these methods is the initialization of the embeddings, as an input to the KGE learning process, which can have an important impact in the accuracy of the final embeddings, as well as in the time required to train them. This is especially relevant for relatively small and frequent updates. We propose a novel informed embedding initialization strategy, which can be seamlessly integrated into existing continual learning methods for KGE, that enhances the acquisition of new knowledge while reducing catastrophic forgetting. Specifically, the KG schema and the previously learned embeddings are utilized to obtain initial representations for the new entities, based on the classes the entities belong to. Our extensive experimental analysis shows that the proposed initialization strategy improves the predictive performance of the resulting KGEs, while also enhancing knowledge retention. Furthermore, our approach accelerates knowledge acquisition, reducing the number of epochs, and therefore time, required to incrementally learn new embeddings. Finally, its benefits across various types of KGE learning models are demonstrated.

artificial intelligence, expert system, machine learning, (17 more...)

2511.11118

Country: North America > United States > Minnesota (0.28)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.66)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.63)

Neural Information Processing SystemsOct-9-2025, 10:46:48 GMT

ea7623ff02edffe68866f88da2667592-Supplemental-Conference.pdf

artificial intelligence, machine learning, similarity, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsOct-9-2025, 10:46:44 GMT

Inner Product-based Neural Network Similarity

Analyzing representational similarity among neural networks (NNs) is essential for interpreting or transferring deep models. In application scenarios where numerous NN models are learned, it becomes crucial to assess model similarities in computationally efficient ways. In this paper, we propose a new paradigm for reducing NN representational similarity to filter subspace distance.

artificial intelligence, machine learning, similarity, (16 more...)

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Virginia (0.04)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Tran, Dung T., Huyen, Huyen Ngoc, Nguyen, Hong, Phan, Xuan-Vu, Nguyen, Nam-Phong

Adaptive Rainfall Forecasting from Multiple Geographical Models Using Matrix Profile and Ensemble Learning

arXiv.org Artificial IntelligenceSep-15-2025

Rainfall forecasting in Vietnam is highly challenging due to its diverse climatic conditions and strong geographical variability across river basins, yet accurate and reliable forecasts are vital for flood management, hydropower operation, and disaster preparedness. In this work, we propose a Matrix Profile-based Weighted Ensemble (MPWE), a regime-switching framework that dynamically captures covariant dependencies among multiple geographical model forecasts while incorporating redundancy-aware weighting to balance contributions across models. We evaluate MPWE using rainfall forecasts from eight major basins in Vietnam, spanning five forecast horizons (1 hour and accumulated rainfall over 12, 24, 48, 72, and 84 hours). Experimental results show that MPWE consistently achieves lower mean and standard deviation of prediction errors compared to geographical models and ensemble baselines, demonstrating both improved accuracy and stability across basins and horizons.

artificial intelligence, data mining, machine learning, (16 more...)

2509.08277

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Asia > Vietnam > Hanoi > Hanoi (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry: Energy (0.54)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Modeling & Simulation (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

arXiv.org Artificial IntelligenceFeb-18-2025

Multiple Distribution Shift -- Aerial (MDS-A): A Dataset for Test-Time Error Detection and Model Adaptation

Ngu, Noel, Taparia, Aditya, Simari, Gerardo I., Leiva, Mario, Corcoran, Jack, Senanayake, Ransalu, Shakarian, Paulo, Bastian, Nathaniel D.

Machine learning models assume that training and test samples are drawn from the same distribution. As such, significant differences between training and test distributions often lead to degradations in performance. We introduce Multiple Distribution Shift -- Aerial (MDS-A) -- a collection of inter-related datasets of the same aerial domain that are perturbed in different ways to better characterize the effects of out-of-distribution performance. Specifically, MDS-A is a set of simulated aerial datasets collected under different weather conditions. We include six datasets under different simulated weather conditions along with six baseline object-detection models, as well as several test datasets that are a mix of weather conditions that we show have significant differences from the training data. In this paper, we present characterizations of MDS-A, provide performance results for the baseline machine learning models (on both their specific training datasets and the test data), as well as results of the baselines after employing recent knowledge-engineering error-detection techniques (EDR) thought to improve out-of-distribution performance. The dataset is available at https://lab-v2.github.io/mdsa-dataset-website.

dataset, model 0, weather condition, (13 more...)

2502.13289

Country:

South America > Argentina (0.04)
North America > United States > Arizona > Maricopa County > Tempe (0.04)

Genre: Research Report (0.50)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Wang, Xiaoyang, Yang, Christopher C.

Enhancing Multi-Attribute Fairness in Healthcare Predictive Modeling

arXiv.org Artificial IntelligenceJan-22-2025

Artificial intelligence (AI) systems in healthcare have demonstrated remarkable potential to improve patient outcomes. However, if not designed with fairness in mind, they also carry the risks of perpetuating or exacerbating existing health disparities. Although numerous fairness-enhancing techniques have been proposed, most focus on a single sensitive attribute and neglect the broader impact that optimizing fairness for one attribute may have on the fairness of other sensitive attributes. In this work, we introduce a novel approach to multi-attribute fairness optimization in healthcare AI, tackling fairness concerns across multiple demographic attributes concurrently. Our method follows a two-phase approach: initially optimizing for predictive performance, followed by fine-tuning to achieve fairness across multiple sensitive attributes. We develop our proposed method using two strategies, sequential and simultaneous. Our results show a significant reduction in Equalized Odds Disparity (EOD) for multiple attributes, while maintaining high predictive accuracy. Notably, we demonstrate that single-attribute fairness methods can inadvertently increase disparities in non-targeted attributes whereas simultaneous multi-attribute optimization achieves more balanced fairness improvements across all attributes. These findings highlight the importance of comprehensive fairness strategies in healthcare AI and offer promising directions for future research in this critical area.

artificial intelligence, data mining, machine learning, (19 more...)

2501.13219

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Texas > Harris County > Houston (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)